When non i.i.d. information sources can be communicationally useful? 
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Information sources used for communication purposes usually are assumed to be i.i.d. type, 
especially as far as entanglement or nonlocality properties are concerned. Here we proposed simple 
scheme for detection of nonlocality of non i.i.d. sources based on the program idea which can 
be useful in communication tasks including quantum cryptography and communication complexity 
reduction. The principle of the scheme is rather general and can be applied to other problems like 
detection of entanglement coming from non-i.i.d sources. 



Introduction .- The problem of locality vs nonlocal- 
ity" was usually treated under the assumption that the 
information source is one which emits signals that are 
independent from each other. Such sources can be de- 
scribed by a sequence of identical and independently dis- 
tributed (i.i.d.) random variables. However there may 
exist sources for which the assumption may not work for 
purely physical reasons. The main idea of the paper is 
to propose a simple scheme for detection of nonlocality 
via Bell inequalities (or detection of quantum entangle- 
ment in terms of witnesses) in (in general multipartite) 
systems which arc non i.i.d, but have some type of classi- 
cal (but not quantum) memory. Let us mention the first 
approach to non i.i.d. sources was proposed in Ref. [ilj in 
terms of entropic quantities and then developed for en- 
tanglement measures [2] and classical capacities (see 
also Ref. Q). Here however we are focused on qualitative 
detection of quantum properties like nonlocality and en- 
tanglement. In particular the nonlocality detection needs 
to be more rigorous. The key notion - in contrary to stan- 
dard approach to nonlocality - is the program that is in- 
dependent from Alice and Bob settings (that are chosen 
randomly) and allow to subselect the substrings of the 
,, truly" nonlocal (or ,, truly" entangled ) data. Usually 
we shall assume that the source has no prior information 
about Alice and Bob settings choices. We show how this 
assumption can be relaxed in some cases. We discuss the 
result for bipartite case, however the scheme works also 
for multipartite versions. 

Scenario and the main idea.- In bipartite version the 
scenario we consider involves spatially separated Alice 
and Bob and a source S. The source is not specified, apart 
from the assumptions, that for each N it provides Alice 
and Bob a sequence of N objects each of which both of 
them can measure, and it does not have access to their 
settings. The task of Alice and Bob is to check if the 
source can be turned into a one which does not admit an 
LHV model. 

Let us remind that in particular the source need not 
lead to independent and identically distributed random 
variables, as assumed in usual Bell inequality tests. We 
are rather closer to the new paradigm that has been con- 



sidered only recently in f^, where the source can be to- 
tally empowered by the eavesdropper Eve. However we 
do not adopt the idea of symmetrization used there, since 
for certain patterns of memory the source which is quite 
useful, would be claimed to be useless, as admitting an 
LHV model. Exemplary source is the one which is 'lo- 
cal' and 'nonlocal' alternately, corresponding to e.g. an 
unstable device. 

The idea is quite simple. If Alice and Bob observe, 
that their data do not violate given Bell inequality, they 
are not yet lost. They can try to design a proper pro- 
gram such that after processing of their (classical) data, 
they will observe the violation. By program we mean 
any algorithm written for any (classical, quantum, hy- 
brid or other) machine that filters some events i.e. in- 
structs which outcomes of measurements should be taken 
into account, and which to be discarded. By proper, we 
mean that the program is independent from the settings 
(choices of measurements involved in Bell inequality test) 
but in principle it may depend on some other data like 
some additional measurement results (as long as they are 
settings- independent too) or even laboratory conditions, 
weather etc. This aspect is common with the original 
concept of hidden nonlocality @, The later is in- 
cluded in our model as a special case, however in the 
present analysis we shall focus on programs dependent 
on Bell experiment events only, which is still vital since 
we deal with non i.i.d. sources. 

In other words, if Alice and Bob have access to a source 
S which appears to be not directly useful, sometimes 
they can transform it into a useful one. It can be written 
symbolically as S' — S+P where P is their program which 
'filters' objects coming from S. 

To make the work with programs more easy, we first 
allow for Alice and Bob to transform their data into a 
binary string, which is called g. The more I's in the 
string, the more probable it is to violate the Bell inequal- 
ity. Hence the program P should be chosen such that it 
instructs about as much of those entries of the string g 
which have 1, as it is possible, without being dependent 
from the settings in Bell experiment. A substring of g 
taken in positions indicated by P forms a string called g'. 
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One may be also interested what happens if the program 
is somewhat correlated with the settings. As we shall 
show if the string g' is compressible enough, i.e. has low 
Kolmogorow complexity, and observed violation on g' is 
high enough, we still may exclude LHV in original source. 

In what follows, the letter in bold such as g will denote 
strings. The letters with indices such as gi will denote 
particular values of the i-th entry of the string. A bold 
capital letter will denote random variable. 

Sequence g- the idea of binary G function.- Consider 
an arbitrary linear two-partite Bell inequality, for correla- 
tions of local outcomes observed at measurement stations 
of Alice (A) and Bob (B) in the following form which 
avoids absolute values: 

^ a{x,y,a,b)P{a,b\x,y) < R. (1) 

Here x and y stand for the measurement settings chosen 
by Alice and Bob respectively, and a and b for the out- 
comes of their measurements, P{a, b\x, y) represents con- 
ditional probabilities and R is the local realistic bound. 
Note that any linear Bell inequality can be brought to the 
above form written in such a form with a{x, y, a,b) > 
since each inequality which has some negative a's can be 
rewritten by substituting probabilities which stand by 
negative a's by unity minus the probability of the oppo- 
site events. With further notation 

a{x, y, a, b) = P{x, y)C{x, y, a, b)G{a, b, x, y) (2) 

where C{a,b,x,y) is any real, positive function and 
G{a,b,x,y) £ {0,1} ([T|) becomes 

^ ^ C(a, b, X, y)G{a, b, x, y)P{a, b, x,y) < R (3) 

x,y a,b 

When one wants to experimentally find the value of 
the LHS of ([3]) one does N runs of the experiment and 
approximates 

P(a, 6, X, y) « ^a„a<5^.,b'^x„x(?y„y 

where variables with index i correspond to the values 
obtained in the i-th run of the experiment. When we 
plug this to ([3]) we get 

N 

C^G, < RN (5) 

1=1 

with abbreviations Ci = C{ai,bi,Xi,yi) and Gi =^ 
G{ai,bi,Xi,yi). The form (O is very useful since it is 
clearly seen which run of the experiment raises the value 
of the LHS (the ones with G = 1) so if one would like to 
make some postselection to increase the LHS one needs 
to choose these runs. The string of N values of G will be 
denoted as g: g= (Gi, G2, Gat). 



Choosing a substring with nonlocal properties - the 
idea of the d string .- Let us assume that the strings 
c=(Gi, Gat) and g=(Gi, Gat) be such that they do 
not violate ^ but there exists a subset of indexes i £l 
such that 

Y,C^G,<RN' (6) 

is violated, where N' is the cardinality of the set /. The 
full description of I can be done with the help of the 
string d G {0, 1}^ of length N, where [d]i = 1 if i S X 
and [d]i = if i ^ X. In what follows, string will be 
the output of program P. We will need also notation for 
substring of g indicated by d which will be denoted as 

g' — (Gd(l), Gd(Ar')). 

The idea of the main result . - From the construction of 
inequality ([5|) it is clear, that if the source admits LHV 
model, then the string g satisfies this inequality. How- 
ever, if g satisfies the inequality, the source that led to 
g may still violate the LHV model. We will provide now 
sufficient condition for this violation. Namely, the latter 
takes place if there is a binary string d such that sub- 
string g' of g taken at positions imposed by d violates 
inequality fSJ and d can be reproduced by a program 
which is independent from the settings. Of course, this 
fact should hold repetitively, that is Alice and Bob in 
order to test the source need to repeat the experiment 
k times for sequence of length N . The number k need 
not be large so that the probability that they can safely 
claim to see the source without LHV model is close to 1. 

The programs .- In practice of laboratory, by a pro- 
gram we will mean any sequence of instructions written in 
any language on a personal computer, a hybrid classical- 
quantum computer, or any other machine. What is only 
important is that the program instruct (each N) how 
to transform an input string g of length N into output 
N' length string g' formed from the subset of indices 1 
through N (one can think about the program as a tran- 
sition function of a deterministic Turing Machine) 
We will actually not need its particular form, but can 
consider it indeed as collection of some functions trans- 
forming input into output. Thus, we adopt the following 
definition: 

Definition 1 A program P is a sequence of algorithms 
with their inputs given by N -bit sequences g and the out- 
puts formed by binary N-bit sequences d. 

A program is independent from a random variable V if 
for each N the random variable P of its output ( induced 
by random variable of an input) is independent from V 
i.e. 

where wj^"*^ ranges the set of outputs of the program P 
of length N respectively. 
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There are two extreme examples of the programs, 
which clarify the idea of this (in)dependence defined 
above, as describe below. 

A negative Example.- Consider the following program: 

"for an input containing length-N g string 
output d = g . " . This program returns the string d 
which describes the substring g' with exactly all those 
values of g where function G indicates correlations in 
the experiment. It is therefore highly dependent from its 
input. Indeed from the output one can reproduce the 
whole input string g. Such a program is cheating from 
our point of view, as its output can not be taken into 
account in context of violation of the Bell inequality: a 
proper post-selection can always lead to its 'violation'. 

In what follows we will first focus on programs which 
are independent from the variable of settings. Let us note 
however, that if the input string is not correlated with 
the settings, the program (if it is short) can not be too 
much correlated with the settings. We develop this case 
in the reminder of this paper. 

A positive Example.- For positive case, consider a fam- 
ily of constant programs P which output the same string 
independently on their inputs. It describes what we shall 
call the simple program: 

"for input containing any N-length string d 
output the fixed binary string d" . 

We have then the following observation, the proof of 
which is obvious: 

Observation 1 Simple program acting on input g is in- 
dependent from the settings pC,YJ. 

Testing procedure and the main result.- Alice and Bob 
will perform certain testing procedure. If the procedure 
passes the test, the source with a high probability does 
not admit the LHV model. The parameter e > will 
capture accuracy of this test. The procedure will be just 
sampling performed on the blocks of signals taken from 
the source. A good event will be if a given block of length 
N violates inequality ([5]) by a constant r > tq with rg > 
describing the threshold of violation (i.e. one expects 
(i? + ro) on the LHS of (O). They will choose randomly 
k out of K blocks. If the amount of good events is sat- 
isfactory they accept the source as characterized by the 
program Pq, and abort the protocol otherwise. By satis- 
factory we mean that the rest of untested blocks (not all 
of which need to violate ([5])) still violate the inequality 
when Pq applied to them. 

Formally, Alice and Bob can proceed few times accord- 
ing to the following procedure: 

1. Produce binary string g of length N. 

2. Choose a program Pq which does not depend on 
settings (for usual Bell inequalities approach, Pq is just 
an identity permutation) 

3. Check if the output of Pq on g indicates N' length 
substring g' of g that violates inequality (O by r > tq > 
0, and abort otherwise. 



4. Take Ntotai = N x K more results from the source. 
Choose randomly k blocks of length TV each. For each of 
them perform step 3 and count the 'yes' answers. 

5. If the number kgood of 'yes' answers satisfies: 



accept, and reject otherwise. 

Let us check that the condition given above guarantees 
violation of the Bell inequalities by the untested blocks 
when Pg applied to them. The number Kgood of good 
events in remaining (untested) part of string of blocks 
can be estimated by sampling lemma ([HI) as Kgood > 
(%^ -e){K -k). By linearity of (O we can add K - k 
violation parameters obtained for each block processed 
by Pq. For the K — k — Kgood blocks we put the worst 
case, that there is not only no violation but just LHS of 
([5]) is zero. The violation of the untested string is then 
lower bounded by Kgood x (i? -I- ra)N' . Hence, by the 
above inequality, we obtain that the total violation of a 
string of length N' x [K — k) (the output oi K — k times 
processed blocks by Pq) has the LHS of the inequality 
bounded from below by 

^h^^,)(^K-k){R + ro)N' (9) 

To have violation of ^ for a string of length K — k, we 
need the above value to be not less than {K — k) x N' x R 
which gives the desired bound ([8]). 

What we aim to show is that we deal here with 'true' 
violation, i.e. that the program does not create itself the 
violating data, but rather extracts the violating string. 

We are ready to provide the main result of this paper. 

Theorem 1 // the source S passes the testing procedure 
that involves program Pq, the source S which has access 
to program Pq does not admit the LHV model with a high 
probability. 

Proof.- By the very assumption about sources, S is in- 
dependent from the variable of settings {X, Y). Since by 
point 1 of testing procedure Pq is also independent from 
settings, we can safely claim, that S with access to pro- 
gram Pq is a valid source, where by access we mean, that 
the output of S is filtered by Pq (only those signals which 
are idicating by string d are passed) . It is now sufficient 
to check, that Alice and Bob observe Bell violation from 
their block-wise postprocessing. It is easy to see that the 
testing procedure is independent from settings as well, 
and can be treated as a soubprocedure of Pq. By con- 
siderations below its definition (a consequence of points 
3 — 5), we obtain that source S equipped with a program 
Pq (or - equivalently - Alice and Bob with this program, 
having access to the source) violates the Bell inequality. 
The probability that the procedure succeeds approaches 
1 is exponentially fast in the number of tested blocks k 
of in testing procedure providing k £ 0{^/K). 
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Simple example .- An elementary example is a quan- 
tum source sending infinite sequence with even (odd) 
two-qubit system in maximally entangled |^'+) (sepa- 
rable (/ — |5'+)(5'+|)/3) state. Observer testing stan- 
dard CHSH inequality will get averages of separable two- 
qubit Werner state which will obviously obey the inequal- 
ity. However the following program P: "take d with 
[d]i = ([d]i — 1) for the index i even (odd)" will 
pass the above testing procedure with maximal violation 
of the inequality {R = 2\/2) which immediately the pres- 
ence of LHV in the source. 

Bounds on programs correlated with settings .- Some- 
times it may happen that the program P is correlated 
to the variable describing the settings. This problem is 
related to the possible restriction on ,,free will" of the 
observers which may lead to false violation of LHV, but 
still nontrivial LHV bounds may be derived (see [lO[). 
Here we shall derive the bounds in G-string type Bell in- 
equality for programs P assumed to be correlated with 
settings. For simplicity we shall consider the Bell-CHSH 
variant, however the idea naturally extends to any Bell 
inequality for which the rate of the outcomes implying 
G = 1 to all outcomes maximized over all setting choices 
and hidden variables is bounded away from 1. 

Assuming that there is some correlation between hid- 
den variable A and the settings we get for fixed A Bell 
inequality component: 

1 

^ P{x, y\\)P{a ® b = xy\x, y) < B (10) 

a:,y— 

where now i? e [f , 1] depending on correlations between 
settings and A, but not on the A itself is to be found. Note 
that of course here still Y^y^ P{x, y\X) = P{x, y) = j since 
the source has no influence of observers choices. The best 
strategies (LHV functions) for the source are those which 
make the sum in LHS of (|10p as large as possible, t is 
not hard to see that the best source strategy to maximize 
LHS of (fTO]) for given four component vector P{x,y\X) is 
the one which multiplies by P{a (B b = xy\x,y) = the 
least component of the vector. This immediately gives 
the bound 

B = l-r, r= min {P(a;,y|A)} (11) 

Now for given B or - equivalently - r let us estimate the 
mutual information /(A : XY) = H{XY) - H{XY\\) 
form below. Clearly H{XY) = 2 while for fixed r 
H{XY\X) is maximized by the vector P{xy\X) with all 
the components other than r equal to each other: 

H{XY\X) < -rlogr - (1 - r) log = /(r) 

which gives /(A : XY) > 2 — ,f{r). It means that if 
the local realistic bound is _B = 1 — r then the source 
has at least 2 — /(r) information about the settings. On 



the other hand, since / is monotonously increasing, if 
the source has / information about the settings then the 
local realistic bound is at most 

B<l-f-\2-I) = B{I) (12) 

Putting this bound into (jlOp and averaging it over A's 
gives the new inequality involving via information / the 
correlations between the hidden parameter and the set- 
tings: 

1 

P{x, y)P{a © b = xy\x, y) < B{I) (13) 

x^y—Q 

with the critical amount of correlation information the 
source can have (ie. the one reaching the quantum 
bound) hr^t = 2 - /(I - Pqs) = 2 - /(^ - ^) « 0.046. 

Note that the parameter A that saturates the latter 
bound can be interpreted as an informational ,, content" 
(or ,, capacity") of correlations contained in the source 
that correspond to quantum behavior. If applied to the 
quantum state 4" it can be regarded as classical hidden 
simulation of correlation properties of the state with re- 
spect to the considered Bell experiment. The role of 
this parameter (or - more precisely - equivalence class 
of such parameters) and its possible optimization over 
set of more experiments will be considered elsewhere. 

For the source with fixed I note that any program P 
producing the string of length A'^' modifies the inequality 
(fT3l) leading for large N' to: 

C^G^ < B{I)N' (14) 

Now consider the A^ length binary string d which is bi- 
nary characteristic function of substring g' of g ie. it has 
zeros on the positions which were rejected by the program 
and ones on those positions which carry those elements of 
g that were chosen to form g'. Suppose that Kolmogorow 
complexity of d is M. It is clear that d cannot contain 
more than M information about the settings. However 
this is the only information that is ,, moved back" to the 
source in the proof of the Theorem 1. Since the settings 
form string of i.i.d., the information the d holds about 
each individual settings follows 

<M (15) 

iex 

where U is the information d holds about the settings 
in i-th round of experiment. The use of the program 
P for the string from local realistic source is equivalent 
to another local realistic source which has this program 
built in. But even though it does not meet the freedom of 
choice requirement it still leads to local realistic bound: 

Ya,G,<YB{h) (16) 
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Finally note that since B{I) is a concave function one 

has YliiS:! ^{^i) — B{w)-^' ■ gi'^i'^g another inequality 
which is of our central interest: 

Y.C.G.<(l-r\'2-^))N' (17) 

It is noteworthy that this gives a bound on CHSH re- 
gardless of the program P. So there are some procedures 
which lead to the bounds that even QM cannot violate. 
Our approach easily gives these bounds. 

Conclusions .- We have provided natural method for 
extracting nonlocality properties form the sources that 
are non i.i.d.. If there exists some even long term mem- 
ory in the source the nonlocality (or entanglement) can 
be detected even if the standard tests fail. It is interesting 
that, as we have shown, in the case of nonlocality it is pos- 
sible to relax the condition about no prior source knowl- 
edge about observers settings. Possible trade off be- 
tween that knowledge and the power to perform commu- 
nication tasks like communication complexity reduction 
or cryptography is an interesting subject for further re- 
search. One can also consider programs with probabilis- 
tic algorithms built-in, especially in context of sources 
with Marcovian memory. These interesting refinements 
of our scheme will be considered elsewhere. Note that 
the present scheme is quite general and may be applied 
to multipartite sources and Bell inequalities. Its princi- 
ples may be also used for entanglement detection and its 
applications like distillation and cryptography. 
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